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BRIEF OF APPELLANTS 

MAIL STOP APPEAL BRIEF - PATENTS 
Commissioner for Patents 
P.O. Box 1450 
Alexandria, VA22313-1450 

Dear Sir: 

In accordance with 37 CFR §41 .37, Appellants* attorney hereby submits the Brief of 
Appellants on appeal from the final rejection in the above-identified application as set forth in the 
Office Action dated July 12, 2004. 

Please charge the amount of $500.00 to cover the required fee for filing this Brief as set 
forth under 37 CFR §41.20(b)(2) to Deposit Account No. 50-1673 of NCR Corporation, the 
assignee of the present application. Also, please charge any additional fees or credit any 
overpayments to Deposit Account No. 50-1673. 

I. REAL PARTY IN INTEREST 

The real party in interest is NCR Corporation, the assignee of the present application. 
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H. KFT.ATED APPEALS AND INTERFERENCES 

There is a related appeal for Application Serial No. 09/410,528, filed on October 1, 1999, by 
Brian D. Tate, James E. Pricer, Tej Anand, and Randy G. Kerber, entitled SQL-Based Analytic 
Algorithm for Association, attorney's docket number 8219. 

There is a related appeal for Application Serial No. 09/410,531, filed on October 1, 1999, by 
James D. HMreth, entitled SQL-Based Analytic Algorithm for Clustering, attorney's docket number 
8220. 

There is a related appeal for Application Serial No. 09,410,530, filed on October 1, 1999, by 
Todd M. Brye, entitled SQL-Based Analytic Algorithm for Rule Induction, attorney's docket number 
8221. 

HI. STATl K OF CLAIMS 

Haimt 1-23 have been canceled. 

Claims 24-83 are pending in the application. 

Claims 24-34, 36-40, and 42-43 were rejected under 35 U.S.C §102(e) as being anticipated by 
Iyer et al., U.S. Patent No. 5,899,992 (Iyer). 

Claim 35 was rejected under 35 US.C §l03(a) as being unpatentable over Iyer in view of 
SAS Institute Inc., SAS OnlineDoc® , Version 8, Guy, NO SAS Institute Inc., (SAS). 

Claim 41 was rejected under 35 US.C §103(a) as being unpatentable over Iyer in view of 
Shafer et aL, SPRINT: A Scalable Parallel Classifier for Data Mining, Proceeding of the 22 nd V1DB 
Conference Mumbai, 1996 (Shafer). 

Claims 44 and 45 were rejected under 35 US.C §103 (a) as being unpatentable over Iyer in 
view of Bridges, US. Parent No- 5,548,770 (Bridges). 

Claims 46-88 were stated to "add no novelty, 55 and thus were "rejected under the same 
rationale as the base claim and each foregoing claim aforementioned," although no formal rejections 
were made. Moreover, it is noted that the application only contains claims numbered to 83, not 88. 

Claims 24-83 are being appealed. 
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IV. STATUS OF AMENDMENTS 

An amendment under 37 CF.R. §1.116 was submitted on September 13, 2004 in response to 
the final Office Action, wherein claims 33, 39, 52, 54, 56, 58, 60, 71, 73, 75, 77 and 79 were amended 
An Advisory Action mailed October 13, 2004 did not indicate that the amendments were being 
entered for the purposes of appeaL Record is made of a telephone interview between the below 
signed Attorney and Examiner Holmes that took place on November 12, 2004, wherein the below- 
signed Attorney inquired as to the status of the claim amendments. Thereafter, a Supplemental 
Advisory Action was mailed on November 23, 2004, which included an Interview Summary, 
indicating that the amendments were being entered for the purposes of appeaL 

V. ST JMMARY OF THE INVENTION 

Appellants' independent claims 24, 44 and 45 are generally directed to computer- 
implemented system for performing data rnining applications. 

Cairn 24 recites a computer-implemented system for performing data mining applications. 
The system includes a computer having one or more data storage devices connected thereto, 
wherein a relational database is stored on one or more of the data storage devices. The system also 
includes a relational database management system, executed by the computer, for accessing the 
relational database stored on the data storage devices. Finally, the system includes an analytic 
application programming interface (API) that generates a set of scalable data mining functions 
including queries for execution by the relational database management system, executed by the 
computer, for performing data mining operations directly within the database management system. 

Claim 44 recites a method for performing data mining applications. The method includes 
the steps of storing a relational database on one or more data storage devices connected to a 
computer, accessing the relational database stored on the data storage devices using a relational 
database management system, and executing an analytic application progr amm ing interface (API) 
that generates a set of scalable data mining functions including queries for execution by the 
relational database management system, for performing data mining operations directly within the 
database management system. 

Claim 45 recites an article of manufacture comprising logic embodying a method for 
performing data mining applications. The method includes the steps of storing a relational database 
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on one or more data storage devices connected to a computer, accessing the relational database 
stored on the data storage devices using a relational database management system, and executing an 
analytic application programming interface (API) that generates a set of scalable data mining 
functions including queries for execution by the relational database management system, for 
performing data mining operations directly within the database management system. 

With, regard to the claims, Appellants 1 attorney requests that the Board refer to the 
specification generally. Specific portions of the specification that directly relate to the claims on 
appeal include: 

(a) at page 6, line 6 through page 6, line 26, and in FIG. 1 as reference numbers 100, 102, 
104, 106, 108, 110, 112, 114, 116 and 118; 

(b) at page 7, lines 5-9; 

(c) at page 7, lines 13-18, and in FIG. 2 as reference numbers 202 and 204; 

(d) at page 8, lines 11-15, and in FIGS. 1 and 2 as reference numbers 110, 114, 118, 200, 202 
and 204; 

(e) at page 9, line 14 through page 15, line 16; and 

(f) at page 16, line 33 through page 18, line 15, in FIGS. 3, 4 and 5 as reference numbers 
300-308, 400-410 and 500-512. 

VI. GROUNDS OF RETECTTON TO BE REVIEWED ON APPEAL 

1. Whether claims 24-34, 36-40, and 42-43 are anticipated under 35 U.S.G §102(e) by 
Iyer et aL, US. Patent No. 5,899,992 (Iyer). 

2. Whether claim 35 is obvious under 35 U.S.G §103 (a) over Iyer in view of SAS 
Institute Inc., SAS CJrJineDoe* , Version 8, Gary, NG SAS Institute Inc., (SAS). 

3. Whether claim41 is obvious under 35 US.C §l03(a) over Iyer in view of Shafer et 
aL, SPRINT: A Scalable Parallel Classifier for Data Mining, Proceeding of the 22 nd VLDB 
Conference Mumbai, 1996 (Shafer). 

4. Whether claims 44 and 45 are obvious under 35 U.S.G §103(a) over Iyer in view of 
Bridges, US. Patent No. 5,548,770 (Bridges). 

5. Whether <-W<™<i 46-88 "add no novelty," and thus are anticipated and obvious "under 
the same rationale as the base claim and each foregoing claim aforementioned." 
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VII. rainf , 1PING OF CLAIMS 

The rejected claims do not stand or fall together. Instead, the claims are grouped as follows: 

(a) claims 24, 26, 42, 44, 45, 47, 63, 66, 82 stand or fall together, 

(b) claims 25, 46 and 65 stand or fall together, 

(c) claims 27, 48 and 67 stand or fall together, 

(d) claims 28-31, 35-41, 49-52, 56-62, 68-71, 75-81 stand or fall together, 

(e) claims 32, 53 and 72 stand or fall together, 

(f) claims 33, 54 and 73 stand or fall together, 

(g) claims 34, 55 and 74 stand or fall together, and 

(h) claims 43, 64 and 83 stand or fall together. 

Separate arguments for each of the groups of claims are provided below. 

VIII. ARGUMENTS 

A. The Office Action Rejections 

In paragraphs (8)-(9) of the Office Action, claims 24-34, 36-40, and 42-43 were rejected 
under 35 U.S.C §102(e) as being anticipated by Iyer et al., US. Patent No. 5,899,992 (Iyer). In 
paragraphs (9)-(l0) of the Office Action, claim 35 was rejected under 35 U.S.C §l03(a) as being 
unpatentable over Iyer in view of SAS Institute Inc., SAS OnlineDoc® , Version 8, Gary, NG SAS 
Institute Inc., (SAS). In paragraphs (ll)-(12) of the Office Action, claim 41 was rejected under 35 
US.C §103(a) as being unpatentable over Iyer in view of Shafer et aL, SPRINT: A Scalable Parallel 
Oassifier for Data Mining, Proceeding of the 22 nd VLDB Conference Mumbai, 1996 (Shafer). In 
paragraphs (13)-(14) of the Office Action, claims 44 and 45 were rejected under 35 US.C §103(a) as 
being unpatentable over Iyer in view of Bridges, US. Patent No. 5,548/70 (Bridges). In paragraph 
(14) of the Office Action, claims 46-88 were stated to "add no novelty," and thus were "rejected 
under the same rationale as the base claim and each foregoing claim aforementioned," although no 

formal rejections were made. 

Appellants' attorney respectfully traverses these rejections. Moreover, Appellants' attorney 
notes that the present application only contains claims numbered to 83, not 88. 
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B. Hie Appellants' Indep endent Claims 

Appellants' independent claims 24, 44 and 45 are generally directed to computer- 
implemented system for performing data mining applications. Oaim 1 is representative and 

comprises the elements of: 

(a) a computer having one or more data storage devices connected thereto, wherein a 
relational database is stored on one or more of the data storage devices; 

(b) a relational database management system, executed by the computer, for accessing the 
relational database stored on the data storage devices; and 

(c) an analytic application programming interface (API) that generates a set of scalable data 
mining functions including queries for execution by the relational database management system, 
executed by the computer, for performing data mining operations directly within the database 
management system. 

C Th* Tver Reference 

Iyer describes a method, apparatus, and article of manufacture for a computer implemented 
scaleable set-oriented classifier. Hie scalable set-oriented classifier stores set-oriented data as a table 
in a relational database. The table is comprised of rows having attributes. The scalable set-oriented 
classifier classifies the rows by building a classification tree. The scalable set-oriented classifier 
determines a gini index value for each split value of each attribute for each node that can be 
partitioned in the classification tree. The scalable set-oriented classifier selects an attribute and a split 
value for each node that can be partitioned based on the determined gini index value corresponding 
to the split value. Then, the scalable set-oriented classifier grows the classification tree by another 
level based on the selected attribute and split value for each node. The scalable set-oriented classifier 
repeats this process until each row of the table has been classified in the classification tree. 

D. The SAS Reference 

SAS describes a correlation matrix. The correlation matrix table contains Pearson product- 
moment correlations of Y variables. Correlation measures the strength of the linear relationship 
between two variables. 
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E. Th^ Shafw- Reference 

Shafer describes a scalable parallel classifier for data mining. A decision-tree-based 
classification algorithm, called SPRINT, removes all memory restrictions and is fast and scalable. 

F. T V Bridges Reference 

Bridges describes an indexing system and method for improving retrieval of data based on a 
query from a user from a database management system including a main computer and a memory 
coupled to the main computer for storing the data. The indexing system includes a parallel computer 
coupled to the main computer and a parallel disk array coupled to the parallel computer. The 
invention includes the steps of storing record based data in the memory of the database 
management system, storing a value based index of selected attributes related to the record based 
data on the parallel disk array, and detemiining whether the parallel computer can be used to execute 
a query to obtain at least a partial result to the query. If so, the query is sent to the parallel computer 
and the query is executed on the parallel computer to obtain at least a partial result. If a final result 
cannot be determined on the parallel computer, the partial result from the parallel computer is sent 
to the database management system and a final result is determined on the database management 
system using the partial result received from the parallel computer. 



G. Arguments Directed To The First Group s fnr Rejection; Whether aaims 24-3A 

16-40. and 42-4^ An- Anticipated Under 35 US C. fl02(d Bv Tver etal.. US, Patent 
NTn. 5.899.992 fiver). 
1 Claims 24-34. 36-40 and 42-43 
Appellants' attorney respectfully submits that Appellants' claims 24-34, 36-40 and 42-43 are 
patentable over the cited reference, because me reference does not teach or suggest the specific 
combination of elements recited in Appellants' independent claims. 

Specifically, the reference does not teach or suggest the limitation of claim 24 directed to "an 
analytic application prograrnming interface (API) that generates a set of scalable data mining 
functions including queries for execution by the relational database management system, executed by 
the computer, for performing data mining operations directly within the database management 
system." 
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The Office Action asserts that Iyer teaches these limitations of Appellants 5 claims at cc 
line 50 through coL 4, line 26 (actually coL 4, line 35), which is set forth below 

The scalable set-oriented classifier 114 of the present invention resorts to 
proven scalable database technology to provide a generic solution to the 
classification problem of scdabUhy. The present invention provides a scalable model 
for classifying rows of a table within a classification tree. Ike scalable set-oriented 
classifier 114 is called the Scalable Supervised Learning Irregardless of Memory 
(SUM) Classifier 114. Not only is the SLIM classifier 114 scalable in regions where 
recently published classifiers are not, but by virtue of building on well known set- 
oriented database management system pBMS) primitives, the SUM classifier 114 
instantly exploits several decades of database research and development. Hie present 
invention rephrases classification, a data mining method, into analysis of data in a 
star schema, formalizing further the interrelationship between data mining and data 
warehousing. 

A description of a prototype built using IBM's DB2 product as the RDBMS 
108, and experimental results for the prototype are discussed below. Generally, the 
experimental results indicate that the DB2-based SUM classifier 114 has desirable 
properties associating it with linear scalability. 

The SUM classifier 114 is built based on a set-oriented access to data 
paradigm. The SUM classifier 114 uses Structured Query Language (SQL), offered 
by most commercial RDBMS 108 vendors, as the basis for the method The SUM 
classifier 114 is based on well known database methodologies and lets the RDBMS 
108 automatically handle scalability. As a result, the SUM classifier 114 will scale as 
long as the database scales. 

The SUM classifier 1 14 leverages the Structured Query Language (SQL) 
Application Programming Interface (API) of the RDBMS 108, which exploits the 
benefits of many years research and development pertaining to: 

(1) scalability 

(2) memory hierarchy 

(3) parallelism 0;i8]) 

(4) optimization of the executions([16]) 

(5) platform independence 

(6) client server API ([17]). 

See S. Sarawagi, Query Processing in Tertiary Memory Databases, VLDB 
1995, [hereinafter Sarawag]; S. Sarawagi and M. Stonebraker, Benefits of Reordering 
Execurion in Tertiary Memory Databases, VLDB 1996, [[hereinafter Stonebraker]; 
G. Bhargava, P. Goel, and B. Iyer, Hypergraph Based Reordering of Outer Join 
Queries with Complex Predicates, SIGMOD 1995, [hereinafter Bhargava]; T. 
Nguyen and V. Srinivasan, Accessing Relational Databases from the World Wide 
Web, SIGMOD 1996, [hereinafter Goel]; G K. Bam et. aL, DB2 Parallel Edition, 
IBM Systems Journal, Vol 34, No 2, 1995, [hereinafter Baru]; each of which is which 
is incorporated by reference herein. 
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Appellants' attorney disagrees with this analysis. 

The only API discussed in the above portions of Iyer is the Structured Query Language 
(SQL) Application Programming Interface (API) of the relational database management system 
(RDBMS). However, nothing in this description teaches or suggests "an analytic application 
programming interface (API) that generates a set of scalable data mining functions including queries 
for execution by the relational database management system, executed by the computer, for 
performing data mining operations directly.within the database management system." Instead, this 
API of the RDBMS in Iyer only invokes functions of the RDBMS, but says nothing about 
generating a set of scalable data raining functions as recited in Appellants' claims. Moreover, the 
scalable set-oriented classifier of Iyer is not analogous to Appellants' claimed analytic API. 

Indeed, the above portions of Iyer do not provide a proper basis for rejecting claims 24-43, 
because nowhere is Iyer properly applied to the limitations of claims 24-43. Instead, the Office 
Action relies on general conclusoty statements regarding Iyer to reject Appellants' claims, without 
addressing the specific limitations of those claims or the specific teachings of Iyer. 

Moreover, Appellants' claimed invention provides operational advantages over the system 
disclosed in Iyer. Moreover, Appellants' claimed invention solves problems not recognized by Iyer. 

Thus, Appellants' attorney submits that independent claim 24 is allowable over Iyer. 
Further, dependent claims 25-34, 36-40 and 42-43 are submitted to be allowable over Iyer in the 
same manner, because they are dependent on independent claim 24, and thus contain all the 
limitations of the independent claim. 

Dependent claims 25-34, 36-40 and 42-43 recite additional novel elements not shown by 
Iyer. The arguments set forth below are directed not only to claims 25-34, 36-40 and 42-43, but to 
their grouped counterpart claims as well* 

2. Claims 25. 46 and 65 
With regard to grouped dependent claims 25, 46 and 65, which recite that the computer 
comprises a parallel processing computer comprised of a plurality of nodes, and each node executes 
one or more threads of the relational database management system to provide parallelism in the data 
mining operations, the Office Action asserts that these limitations are taught by Iyer Appellants' 
attorney disagrees. At the indicated location, Iyer refers to nodes of a classification tree, not the 
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nodes of a parallel processing computer that executes threads of a relational database management 
system. 

% Claims 2fi ? 47 and 66 
With regard to grouped dependent claims 26, 47 and 66, which recite that the scalable data 
mining functions process data collections stored in the relational database and produce results that 
are stored in the relational database, these claims stand or fall with claims 24, 44 and 45, respectively. 

d rhim q 27. 48 and 67 
With regard to grouped dependent claims 27, 48 and 67, which recite that the scalable data 
mining functions are created by parameterizing and instantiating the analytic API, the Office Action 
asserts that these limitations are taught by Iyer. Appellants' attorney disagrees. At the indicated 
location, Iyer refers to an SQL API, not an analytic API for creating scalable data mining functions. 

5. Claims 28. 49 and 68 

With regard to grouped dependent claims 28, 49 and 68 3 which recite that the scalable data 
mining functions are dynamically generated queries comprised of combined phrases with 
substituting values therein based on parameters supplied to the analytic API, the Office Action 
asserts that these limitations are taught by Iyer. Appellants' attorney disagrees. Ar the indicated 
location, Iyer refers to an SQL API, not an analytic API for creating scalable data mining functions, 
and thus does not dynamically generate queries comprised of combined phrases with substituting 
values therein based on parameters supplied to the analytic APL 

6. Claims 29. 50 and 69 

With regard to grouped dependent claims 29, 50 and 69, which recite that the scalable data 
mining functions comprise Data Description functions, Data Derivation functions, Data Reduction 
functions, Data Reorganization functions, Data Sampling functions, or Data Partitioning functions, 
these claims stand or fall with claims 28, 49 and 68, respectively. 
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7. Claim* 30.51 and 70 
With regard ro grouped dependent claims 30, 51 and 70, which recite that the Data 
Description functions comprise descriptive statistical functions, these claims stand or fell with claims 
29, 50 and 69, respectively. 

a ngf'ms 31. 52 and 71 
With regard to grouped dependent claims 31, 52 and 71, which recite that the Data 

Description functions comprise: 

(1) descriptive statistics for one or more numeric columns, wherein the statistics are 
selected from a group comprising count, minimum, maximum, mean, standard 
deviation, standard mean error, variance, coefficient of variance, skewness, kurtosis, 
uncorrected sum of squares, corrected sum of squares, and quantiles, 

(2) a count of values for a column, 

(3) a calculated modality for a column, 

(4) one or more bin numeric columns of counts with overlay and statistics options, 

(5) one or more automatically sub- binned numeric columns giving additional counts and 
isolated frequendy occurring individual values, 

(6) a computed frequency of one or more column values, 

(7) a computed frequency of values for pairs of columns in a column list, 

(8) a Pearson Product-Moment Correlation matrix, 

(9) a Covariance matrix, 

(10) a sum of squares and cross-products matrix, or 

(1 1) a count of overlapping column values in one or more combinations of tables, 
these claims stand or fall with claims 29, 50 and 69, respectively. 

9^ Harms 32. 53 and 72 
Whh regard to grouped dependent claims 32, 53 and 72, which recite that the Data 
Derivation functions provide column derivations or transformations, the Office Action asserts that 
these limitations are taught by Iyer. Appellants' attorney disagrees. At the indicated location, Iyer 
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merely describes reforming table names, but says nothing about column derivations or 
transformations. 

10 Halms 33. 54 and 73 
With regard to grouped dependent claims 33, 54 and 73, which recite that the Data 
Derivation functions comprise: 

(1) a derived binned numeric column wherein a new column is bin number, 

(2) a n- valued categorical column durnrny^coded into V 0/ 1 values, 

(3) a n- valued categorical column recoded into n or less new values, 

(4) one or more numeric columns scaled via range transforation, 

(5) one or more columns scaled to a Zrscore that is a number of standard deviations 
from a mean, 

(6) one or more numeric columns scaled via a sigmoidal transformation function, 

(7) one or more numeric columns scaled via a base 10 logarithm f unction, 

(8) one or more numeric columns scaled via a natural logarithm function, 

(9) one or more numeric columns scaled via an exponential function, 

(10) one or more numeric columns raised to a specified power, 

(1 1) one or more numeric columns derived via user defined transformation function, 

(12) one or more new columns derived by ranking one or more columns or expressions 
based on order, 

(13) one or more new columns derived with quantile 0 to n- 1 based on order and n, 

(14) a cuntuktive sum of a value expression based on a son expression, 

(15) a moving average of a value expression based on a width and order, 

(16) a moving sum of a value expression based on a width and order, 

(17) a moving difference of a value expression based on a width and order, 

(18) a moving linear regression value derived from an expression, width, and order, 

(19) a multiple account/ product ownership bitmap, 

(20) a product ownership bitmap over multiple time periods, 

(2 1) one or more counts, amount, percentage rneans and intensities derived from a 
transaction summary, 
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(22) one or more variabilities derived from transaction summary data, 

(23) one or more derived trigonometric values and their inverses* including sin, arcsm, 
cos, arccos, esc, arccsc, sec, aitsec, tan, arctan, cot, and arccot, or 

(24) one or more derived hyperbolic values and their inverses, including sinh, arcsinh, 
cosh, arccosh, csch, arccsch, sech, arcsech, taoh, arctanh, coth, and arccoth, 

the Office Action asserts that the limitations of element (12) are taught by Iyer, Appellants' attorney 
disagrees. Ax the indicated location, Iyer merely describes forming groupings, not new columns 
derived by ranking one or more columns or expressions based on order. 

1 1 Hai'ms 34, ft5 and 74 
With regard to grouped dependent claims 34, 55 and 74, which recite that the Data 
Reduction functions provide matrix building operations to reduce the amount of data required for 
analytic algorithms, the Office Action asserts that these limitations are taught by Iyer. Appellants 1 
attorney disagrees. At the indicated location, Iyer merely describes the use of a "count matrix", but 
says nothing about matrix building operations that reduce the amount of data required for analytic 
algorithms. 

1? riaWl^ g7 and 76 
With regard to grouped dependent claims 36, 57 and 76, which recite that the Data 
Reorganization functions provide an ability to reorganize data by joining or de-nor malizin g pre- 
processed results into a wide analytic data set, these claims stand or fall with claims 29, 50 and 69, 
respectively. 

n Hai ms 37. 58 and 77 
With regard to grouped dependent claims 37, 58 and 77, which recite that the Data 
Reorganization functions comprise; 

(1) create a de-normalized new table by removing one or more key columns, or 

(2) join a plurality of tables or views into a combined result table, 
these claims stand or fall with claims 29, 50 and 69, respectively. 



13 



PAGE 16/39 * RCVD AT 1/12/2005 6:57:10 PM pastern Standard Time] * SVR:USPT0-EFXRM/2 * DNIS:8729306 * CSID:+1 31 0641 8798 * DURATION (mm-ss):10-26 



01-1 2-2005 04:13PM F ROM-Gates & Cooper LLP 



413106418798 T-895 P. 017/039 F-062 



14 Claims 38. 59 and 78 
With regard to grouped dependent claims 38, 59 and 78, which recite that the Data Sampling 
function provides an ability to construct a new table containing a randomly selected subset of the 
rows in an existing table or view, these claims stand or fall with claims 29, 50 and 69, respectively. 

is n«in*;39.60and79 
With regard to grouped dependent claims 39, 60 and 79, which recite that the Data Sample 
function selects one or more data samples of specified sizes from a table these claims stand or fall 
with claims 29, 50 and 69, respectively. 

1A Claim* 40.61 and 80 
With regard to grouped dependent claims 40, 61 and 80, which recite that the Data 
Pamrioning function provides an ability to construct a new table containing at least one randomly 
selected subset of the rows in an existing table or view, wherein the subsets are mutually distinct but 
all-inclusive subsets of data, these claims stand or fall with claims 29, 50 and 69, respectively. 



17 n^ ms 42. 63 and 82 
With regard to grouped dependent claims 42, 63 and 82, which recite that results of the data 
mining operations are stored in the relational databases, these claims stand or fall with claims 24, 44 
and 45, respectively. 

1« n a ;™ 43. 64 and 83 
With regard to grouped dependent claims 43, 64 and 83, which recite that the relational 
database management system further comprises an analydcal logical data model that stores metadata 
and processing results from the Scalable Data Mining Functions, the Office Action asserts that these 
limitations are taught by Iyer. Appellants' attorney disagrees. At the indicated location, Iyer says 
nothing about an analytical logical data model that stores metadata and processing results from the 
scalable data mining functions, but instead merely refers to a tr aining set and leaf node list. 



14 



PAGE 17/39 1 RCVD AT 1/1212005 6:57:10 PM [Eastern Standard Time] * SVR:USPT0-EFXRF-1/2 * DN1S:8729306 1 CS1D:+13106418798 * DURATION (mm-ss):10-26 



01-12-2005 04:13PM 



FROW-Gatss & Cooper LLP 



+13106418798 



T-895 P. 01 8/039 F-062 



R Ar guments Dii*W To The Second Grounds for Rejection: Wh rthpr Claim }S Is 
OW»m Under 35 T LS.C $103 to Over Iver In View Of gA S Institute frg. 
1 n *\ nvi 35. 56 and 75 

The arguments set forth below are directed not only to claim 35, but to its grouped 
counterpart claims as well. 

With regard to grouped dependent claims 35, 56 and 75, which recite that the Data 

Reduction functions comprise: 

(1) build one or more data reduction matrices from a group comprising: (i) a Pearson- 
Product Moment Correlations matrix; (ii) a Covariances matrix; and (iii) a Sum of 
Squares and Gross Products (SSCP) matrix, 

(2) export a resultant matrix, or 

(3) restart a matrix operation, 

these claims stand or fall with claims 29, 50 and 69, respectively* 

L Ar guments Directed To The Third Grounds for Rejection: Whether Claim 41 _Is 
Obvious Over Iver In View Of Shafer. 

The arguments set forth below are directed not only to claim 41, but to its grouped 
counterpart claims as well. 

With regard to grouped dependent claims 41, 62 and 81, which recite that the Data 
Partkioning function selects one or more data partitions from a table using a database internal 
hashing technique, these claims stand or fall with claims 29, 50 and 69, respectively. 

J. Ar guments Directed To The Fourth G round* for Rejection: Whether Claims 44 and 
45 Are Obvious Over Iyer In view Of Bridges 

_L Claims 44 and 45 

Appellants' attorney respectfully submits that Appellants' claims 44 and 45 are patentable 
over the cited references, because the references do not teach or suggest the specific combination of 
elements recited in Appellants' independent claims. 
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Specifically, the references do not teach or suggest the limitations of claims 44 and 45 
directed to "an analytic application programming interface (API) that generates a set of scalable data 
mining functions including queries for execution by the relational database management system, 
executed by the computer, for performing data mining operations directly within the database 

management system." 

The Office Action fails to specifically address this limitation, but Appellants' attorney 
assumes that the Office Action intended a rejection similar to that made against claim 24, herein 
the Office Action asserts that Iyer teaches these limitations of Appellants* claims at col. 3, line 50 
through coL 4, line 26 (actually coL 4, line 35), which is set forth below: 

The scalable set-oriented classifier 114 of the present invention resorts to 
proven scalable database technology to provide a generic solution to the 
classification problem of scalability. The present invention provides a scalable model 
for classifying rows of a table within a classification tree. The scalable set-oriented 
classifier 114 is called the Scalable Supervised Learning Irregardless of Memory 
(SLIM) Classifier 1 14. Not only is the SLIM classifier 114 scalable in regions where 
recendy published classifiers are not, but by virtue of building on well known set- 
oriented database management system pBMS) primitives, the SLIM classifier 114 
instantly exploits several decades of database research and development. The present 
invention rephrases classification, a data mining method, into analysis of data in a 
star schema, formalizing further tile interrelationship between data mining and data 
warehousing. 

A description of a prototype built using IBM's DB2 product as the RDBMS 
108, and experimental results for the prototype are discussed below. Generally, the 
experimental results indicate that the DB2-based SUM classifier 114 has desirable 
properties associating it with linear scalability. 

The SLIM classifier 114 is built based on a set-oriented access to data 
paradigm- The SUM classifier 114 uses Structured Query Language (SQL), offered 
by most commercial RDBMS 108 vendors, as the basis for the method. The SUM 
classifier 114 is based on well known database methodologies and lets the RDBMS 
108 automatically handle scalability As a result, the SLIM classifier 1 14 will scale as 
long as the database scales. 

The SUM classifier 114 leverages the Structured Query Language (SQL) 
Application Programming Interface (API) of the RDBMS 108, which exploits the 
benefits of many years research and development pertaining to: 

(1) scalabflity 

(2) memory hierarchy 

(3) parallelism ([18]) 

(4) optimization of the execurioris(£l6]) 

(5) platform independence 
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(6) cUent server API £171). , xrTTM3 

See S. Sarawagi, Que ty Processing in Tertiary Memory Databases, VUJfl 
1995 [hereinafter Sarawagi S- Sarawagi and M. Stonebraker, Benefits of Reordering 
Exertion in Tertiary Memory Databases, VLDB 1996, [[hereinafter Stonebraker]; 
G Bhargava, P. Goel, arid B. Iyer, Hypergraph Based Reordering of Outer Join 
Queries with Complex Predicates, SIGMOD 1995, [hereinafter BhargavaJ X 
Nguyen and V. Srinivasan, Accessing Relational Databases from the World Wide 
Web SIGMOD 1996, [hereinafter Goel]; C K. Barn eL al, DB2 Parallel Edition, 
IBM Systems Journal, VoL 34, No 2, 1995, [hereinafter Barufc each of which is which 
is incorporated by reference herein. 

Appellants' attorney disagrees with this analysis. 

The only API discussed in the above portions of Iyer is the Structured Query Language 
(SQL) Application Programming Interface (API) of the relational database management system 
(RDBMS). However, nothing in this description teaches or suggests "an analytic application 
programming interface (API) that generates a set of scalable data mining functions including queries 
for execution by the relational database management system, executed by the computer, for 
performing data mining operations directly within the database management system." Instead, this 
API of the RDBMS in Iyer only invokes functions of the RDBMS, but says nothing about 
generating a set of scalable data mining functions as recited in Appellants' claims. Moreover, the 
scalable set-oriented classifier of Iyer is not analogous to Appellants 5 claimed analytic APL 

Indeed, the above portions of Iyer do not provide a proper basis for rejecting claims 44 and 
45, because nowhere is Iyer properly applied to the limitations of claims 44 and 45. Instead, the 
Office Action relies on general conchisory statements regarding Iyer to reject Appellants' claims, 
without addressing the specific limitations of those claims or the specific teachings of Iyer. 

Further, Bridges fails to overcome the deficiencies of Iyer* Recall that Bridges was cited only 
for a parallel computer system, and thus is not related to these limitations. 

Appellants* claimed invention provides operational advantages over the system disclosed in 
the various references. Moreover, Appellants* claimed invention solves problems not recognized by 
the cited references. 

Thus, Appellants' attorney submits that independent claims 44, and 45 are allowable over 
Iyer and Bridges. 
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K. A liments DiierrM To The Fift h r.r™^ fnr B ^rrinii: Wherher Claims 46-88 
Are Anticipated or Obvious 

Appellants" attorney submits dependent claims 46-88 are allowable over the references, 
because they are dependent on independent claims 44 and 45, respectively, and thus contain all the 
limitations of the independent claims. Moreover, dependent claims 46-83 recite additional novel 
elements not shown by die references. 

The arguments set forth below are directed not only to claims 46-88, but to their grouped 

counterpart claims as well. 

1. Claims 25. 46 and 65 



With regard to grouped dependent claims 25, 46 and 65, which recite that the computer 
comprises a parallel processing computer comprised of a plurality of nodes, and each node executes 
one or more threads of the relational database management system to provide parallelism in the data 
mining operations, the Office Action asserts that these limitations are taught by Iyer. Appellants' 
attorney disagrees. At the indicated location, Iyer refers to nodes of a classification tree, not the 
nodes of a parallel processing computer that executes threads of a relational database management 
system. 

7 C lai ms 26. 47 and 66 
With regard to grouped dependent claims 26, 47 and 66, which recite that the scalable data 
mining functions process data collections stored in the relational database and produce results that 
are stored in the relational database, these claims stand or fall with claims 24, 44 and 45, respectively. 

3. Claims 27. 48 and 67 
With regard to grouped dependent claims 27, 48 and 67, which recite that the scalable data 
mining functions are created by parameterizing and instantiating the analytic API, the Office Action 
asserts that these limitations are taught by Iyer. Appellants' attorney disagrees. At the indicated 
location, Iyer refers to an SQL API, not an analytic API for creating scalable data mining functions. 
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flfli'TTrt 28. 49 and 68 



With regard to grouped dependent claims 28, 49 and 68, which reche that the scalable data 
functions are dynamically generated queries comprised of combined phrases with 
substituting values therein based on parameters supplied to the analytic API, the Office Acrion 
asserts that these limitations are taught by Iyer. Appellants' attorney disagrees. At the indicated 
location, Iyer refers to an SQL API, not an analytic API for creating scalable data mining functions, 
and thus does not dynamically generate queries comprised of combined phrases with substituting 
values therein based on parameters supplied to the analytic API. 

5 Claim* 29. 5Q and 69 
With regard to grouped dependent claims 29, 50 and 69, which recite that the scalable data 
• mining functions comprise Data Description functions, Data Derivation functions, Data Reduction 
functions, Data Reorganization functions, Data Sampling functions, or Data Partitioning functions, 
these claims stand or fall with claims 28, 49 and 68, respectively. 

n^ ms ^ 51 and 70 
\7ith regard to grouped dependent claims 30, 51 and 70, which recite that the Data 
Description functions comprise descriptive statistical functions, these claims stand or fall with claims 
29, 50 and 69, respectively. 

7 ngiTns31.52and71 
With regard to grouped dependent claims 31, 52 and 71, which recite that the Data 
Description functions comprise: 

(1) descriptive statistics for one or more numeric columns, wherein the statistics are 
selected from a group comprising count, minimum, maximum, mean, standard 
deviation, standard mean error, variance, coefficient of variance, skewness, kurtosis, 
uncorrected sum of squares, corrected sum of squares, and quantiles, 

(2) a count of values for a column, 

(3) a calculated modality for a column, 

(4) one or more bin numeric columns of counts with overlay and statistics options, 
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(5) one or more automatically subsumed numeric columns giving additional counts and 
isolated frequently occurring individual values, 

(6) a computed frequency of one or more column values, 

(7) a computed frequency of values for pairs of columns in a column list, 

(8) a Pearson Product-Moment Correlation matrix, 

(9) a Covariance matrix, 

(10) a sum of squares and cross-products matrix, or 

(1 1) a count of overlapping column values in one or mote combinations of tables, 
these claims stand or fall with claims 29, 50 and 69, respectively. 



8. (3^32. 53 and 72 
With regard to grouped dependent claims 32, 53 and 72, which recite that the Data 
Derivation functions provide column derivations or transformations, the Office Action asserts th; 
these limitations are taught by Iyer. Appellants 1 attorney disagrees. At the indicated location, Iyer 
merely describes transforming table names, but says nothing about column derivations or 
transformations. 

9 naiins33.54and73 
With regard to grouped dependent claims 33, 54 and 73, which recite that the Data 
Derivation functions comprise: 

(25) a derived binned numeric column wherein a new column is bin number, 

(26) a n-valued categorical column dumrny^coded into "n* 0/ 1 values, 

(27) a n-valued categorical column recoded into n or less new values, 

(28) one or more numeric columns scaled via range transformation, 

(29) one or more columns scaled to a z-score that is a number of standard deviations 
from a mean, 

(30) one or more numeric columns scaled via a sigmoidal transforrnarion function, 

(3 1) one or more numeric columns scaled via a base 10 logarithm function, 

(32) one or more numeric columns scaled via a natural logarithm function, 

(33) one or more numeric columns scaled via an exponential function, 
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(34) one or more numeric columns raised to a specified power, 

(35) one or more numeric columns derived via user defined transformation function, 

(36) one or more new columns derived by ranking one or more columns or expressions 
based on order, 

(37) one or more new columns derived with quantile 0 to n-1 based on order and n, 

(38) a cumulative sum of a value expression based on a sort expression, 

(39) a moving average of a value expression based on a width and order, 

(40) a moving sum of a value expression based on a width and order, 

(41) a moving difference of a value expression based on a width and order, 

(42) a moving linear regression value derived from an expression, width, and order, 

(43) a multiple account/product ownership bitmap, 

(44) a product ownership bitmap over multiple time periods, 

(45) one or more counts, amount, percentage means and intensities derived from a 
transaction summary, 

(46) one or more variabilities derived from transaction summary data, 

(47) one or more derived trigonometric values and their inverses, including sin, arcsin, 
cos, arccos, esc, arccsc, sec, arcsec, tan, arctan, cot, and arccot, or 

(48) one or more derived hyperbolic values and their inverses, including sinh, arcsinh, 
cosh, arccosh, csch, arccsch, sech, arcsech, tanh, arctanh, coth, and arccoth, 

the Office Action asserts that the limitations of element (12) are taught by Iyer. Appellants' attorney 
disagrees. At the indicated location, Iyer merely describes forming groupings, not new columns 
derived by ranking one or more columns or expressions based on order. 

1 0. Claims 34. 55 and 74 
With regard to grouped dependent claims 34, 55 and 74, which recite mat the Data 
Reduction functions provide matrix bunding operations to reduce the amount of data required for 
analytic algorithms, the Office Action asserts that these limitations are taught by Iyer. Appellants' 
attorney disagrees. At the indicated location, Iyer merely describes the use of a "count matrix", but 
says nothing about matrix building operations that reduce the amount of data required for analytic 
algorithms. 
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11. Oaims 35. 56 and_75 
With regard to grouped depended claims 35, 56 and 75, which recite that the Data 
Reduction functions comprise: 

(4) build one or more data reduction matrices from a group comprising: (i) a Pearson- 
Product Moment Correlations matrix; (u) a Govariances matrix; and (iii) a Sum of Squares 
and Cross Products (SSCP) matrix, 

(5) export a resultant matrix, or 

(6) restart a matrix operation, 

These claims stand or fall with claims 29, 50 and 69, respectively. 

19 Halms 36. 57 and 76 
With regard to grouped dependent claims 36, 57 and 76, which recite that the Data 
Reorganization functions provide an ability to reorganize data by joining or de-normalizing pie- 
processed results into a wide analytic data set, these claims stand or fall with claims 29, 50 and 69, 
respectively. 

11, flaintt 37. 58 and 77 
With regard to grouped dependent claims 37, 58 and 77, which recite that the Data 
Reorganization functions comprise: 

(1) create a de-normalized new table by removing one or more key columns, or 

(2) join a plurality of tables or views into a combined result table, 
these claims stand or fall with claims 29, 50 and 69, respectively. 

1 d n *™* 18, 59 and 78 
With regaid to grouped dependent claims 38, 59 and 78, which recite that the Data Sampling 
function provides an ability to construct a new table containing a randomly selected subset of the 
rows in an existing table or view, these claims stand or fall with claims 29, 50 and 69, respectively. 
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15 Claims 39. 60 and 79 
Wkh regard to grouped dependent claims 39, 60 and 79, which recite that the Data Sample 
function selects one or more data samples of specified sizes from a table these claims stand or fall 
with claims 29, 50 and 69, respectively. 

1 A Claims 40. 61 and 80 
With regard to grouped dependent claims 40, 61 and 80, which recite that the Data 
Partitioning function provides an ability to construct a new table containing at least one randomly 
selected subset of the rows in an existing table or view, wherein the subsets are mutually distinct but 
allrinclusive subsets of data, these claims stand or fall with claims 29, 50 and 69, respectively. 

17 Claims^, 62 and 81 
With regard to grouped dependent claims 41, 62 and 81, which recite that the Data 
Partitioning function selects one or more data partitions from a table using a database internal 
hashing technique, these claims stand or fall with claims 29, 50 and 69, respectively. 

1« f^ ainns4^ M andR2 

With regard to grouped dependent claims 42, 63 and 82, which recite that results of the data 
mining operations are stored in the relational databases, these claims stand or fall with claims 24, 44 
and 45, respectively 

19. Claims 43. 64 and 83, 
With regard to grouped dependent claims 43, 64 and 83, which recite that the relational 
database management system further comprises an analytical logical data model that stores metadata 
and processing results from the Scalable Data Mining Functions, the Office Action asserts that these 
limitations are taught by Iyer. Appellants' attorney disagrees. At the indicated location, Iyer says 
nothing about an analytical logical data model that stores metadata and processing results from the 
scalable data mining functions, but instead merely refers to a training set and leaf node list. 
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IX. CONCLUSION 

. In light of the above arguments, Appellants respectfully submit that the cited references do 
not anticipate nor render obvious the claimed invention. More specifically, Appellants' claims recite 
novel physical features which patentably distinguish over any and all references under 35 U.S.C §§ 
102 and 103. As a result, a decision by the Board of Patent Appeals and Interferences reversing the 
Examiner and directing allowance of the pending claims in the subject application is respectfully 
solicited. 
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